A MOTIVATED PROOF OF GORDON'S IDENTITIES 



JAMES LEPOWSKY AND MINXIAN ZHU 

Abstract. We generalize the "motivated proof" of the Rogers-Ramanujan identities given by 
Andrews and Baxter to provide an analogous "motivated proof" of Gordon's generalization of 
the Rogers-Ramanujan identities. Our main purpose is to provide insight into certain vertex- 
algebraic structure being developed. 



1. Introduction 



In [AB], G. Andrews and A. Baxter have provided an interesting "motivated proof" of the 
two Rogers-Ramanujan identities (among the large number of proofs in the literature), which 
we write in the form: 

II rh; = Ep^ m 

n>l, n^O, ±2 (mod 5) m>0 

and 



n 



i 



n>l, n^0,±l (mod 5) 



1 



m>0 



where 



and 



Pi(m) 



Pi{m) 



the number of partitions of m for which adjacent parts have 
difference at least 2 



the number of partitions of m for which adjacent parts have 
difference at least 2 and in which 1 does not appear, 

and where q is a formal variable. The idea was to start from the product sides, in which the 
difference-two condition is invisible, and to both motivate the expressions on the right-hand 
sides and prove the two identities, as follows: 

One subtracts the second product side, called G^g), from the first one, called G\(q), then 
one divides the difference by q, giving a formal series Gs(q), and then one forms G±{q) = 
(G 2 (q) - Gs(q))/q 2 . One repeats this process, giving Gi{q) = (GV 2 (<7) - / Y~ 2 for all 
i > 3. One notices empirically that for each i > 1, Gi(q) is a formal power series (that is, it 
involves only nonnegative powers of q), it has constant term 1, and Gi{q) — 1 is divisible by 
q % . This is the "Empirical Hypothesis" of Andrews-Baxter. Assuming its truth, one easily gets 
the two Rogers-Ramanujan identities. Then, with this as motivation, one proceeds to prove 
the Emprical Hypothesis directly from the product sides, thus proving the Rogers-Ramanujan 
identities. (In |ABj . q is taken to be a complex variable of absolute value less than 1, but in 
fact, the content of the argument is purely formal, and we shall take q to be a formal variable.) 

l 
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An initial motivation for the work in |AB] was to show from the product sides the highly 
non-obvious fact that the difference of the two product sides (in the same order as above) is a 
formal power series with nonnegative coefficients. This was Leon Eherenpreis's problem, and 
Andrews and Baxter gave a motivated proof of this fact as preparation for their motivated proof 
of the identities themselves. Also, as is recalled in |ABj . that proof of the identities is closely 
related to Baxter's proof of the identities, starting in [B], and moreover, that proof is also closely 
related to Rogers's and Ramanujan's proof in [RR] . 

In the present paper, we generalize the Andrews-Baxter "motivated proof" to give an analo- 
gous proof of Gordon's form ([G], [Alj ) of the Gordon- Andrews generalizations of the Rogers- 
Ramanujan identities, essentially in the form in which they are presented in Theorem 7.5 of [A2| , 
where for each k > 2 and i = 1, . . . , k, a suitable infinite product in q is expressed as a formal 
power series in q for which the coefficient of q m is the number of partitions of m such that parts 
at distance k — 1 have difference at least 2 and such that 1 appears at most k — i times; the case 
k = 2 is the pair of Rogers- Ramanuj an identities. (For a partition m = p± + • • • + p n of m with 
Pi > • • • > Pn > and for t > 1, saying that parts at distance t have difference at least 2 means 
that p s — Ps+t > 2 whenever s > 1 and s + t < n.) We do not address Andrews's multisum 
form of the sum sides, as presented in Theorem 7.8 of |A2j . While our proof is (necessarily) 
more complicated than the case k = 2, it is similar, although interesting new phenomena arise. 
Also, since we know what is going to happen, we take the liberty of identifying the appropriate 
analogue and generalization of the Empirical Hypothesis as our "Empirical Hypothesis," even 
though we did not observe its validity empirically before actually proving it directly from the 
product sides. Our proof includes (a variant of) the proof in [AB] as a special case. 

Our reason for wanting to work out such a proof stemmed from the relations between the 
Rogers- Ramanuj an identities, and generalizations, and what is now known to be vertex operator 
algebra theory, as follows: 

By retrospective analogy with the approach to the Rogers- Ramanuj an identities in [AB], 
the vertex-operator-theoretic proof of the Rogers- Ramanuj an identities along with the vertex- 
operator-theoretic interpretation of their Gordon-Andrews-Bressoud generalizations in [LW2J- 
[LW4] also started from the product sides as "given" [LMJ , and the problem at the time was to 
discover what turned out to be new structure, not previously anticipated, that would "explain" 
the sum sides (the difference- two condition and its variants). The result, based on the vertex 
operator theory whose discovery and development was the subject of those works, was the 
theory and application of "Z-algebras" — primarily twisted Z-algebras in [LW2| - [LW4| . and then 
untwisted Z-algebras in [LPj . — which later turned out to be understood in retrospect as the 
natural generating substructures of certain generalized vertex algebras, or abelian intertwining 
algebras, as developed in [DL], and twisted modules for them. (These Z-algebras, both untwisted 
and twisted, were also to arise as "parafermion algebras" in conformal field theory ([ZF1], 
|ZF2j).) In |LW2| - [LW4| . each identity was related to certain vertex-operator-theoretic structure 
constructed from a certain module for an affine Lie algebra, and the structures associated with 
different identities were not "compared" with one another, in the spirit of product sides being 
subtracted, etc.; the structures for the different identities were developed in parallel (with the 
proofs for the parallel structures certainly being closely related). The structures were based on 
twisted Z-operators, built starting from the twisted vertex operator in |LWlj . Later, a very 
different vertex- algebraic approach to the sum sides of the Rogers-Ramanujan and Gordon- 
Andrews identities was developed in [CLMlj . |CLM2j . [CalLMlj and [CalLM2j . this time based 
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on untwisted intertwining operators (in the sense of vertex operator algebra theory), and this 
time, indeed relating the family of different identities with the same "modulus" (the number 
2k + 1 in the notation above). In this work, the classical Rogers-Ramanujan recursion (q- 
difference equation) and Rogers-Selberg recursions had suggested what turned out to become 
certain systems of exact sequences constructed from untwisted intertwining operators among 
the "principal subspaces," in the sense of [FSlj -[FS2j. of different modules for certain vertex 
algebras, and it was this vertex-algebraic structure that was of primary interest. 

With this as background, we can now say that the initial reason for our interest in the 
motivated proof in [AB| is that that proof suggested to one of us (J.L.) and Antun Milas the 
potential new idea in vertex operator algebra theory to use twisted intertwining operators among 
twisted modules for suitable vertex-algebraic structures to develop new structure in the theory 
that would "re-explain" the identities from this new point of view. Such a program is underway. 
As was the case in the work mentioned above, the potential new structure suggested, this time, 
by the "motivated proof" is our main goal. 

In Section 2 we give our "motivated proof" of Gordon's identities, in Section 3 we reinterpret 
the sequence of equalities (|2.12p at the core of the proof, and in Section 4 we "explain" the 
meaning of this sequence of equalities. 

The series in q and z below are formal series (rather than convergent series in complex 
variables). 



2. The motivated proof 
Fix an integer k > 2. For each i = 1, . . . ,k, define 



^ = n rh- (2J) 

n>l, n^O, ±(fc+l-i) (mod 2k+l) 



Recalling the Jacobi triple product identity, 



= n( i -« 2n+2 )( i -^ 2n+i )( i -^ i ^ 2n+1 )' 

AeZ n>0 



2fc+l 2i— 1 

and replacing q by q 2 and z by q 2 1 we have 

1 + £A>l(-l) V 2fc+1)( 20 + ( ^ +1)A (l + g <*-DA) 

Gt - = rwwo (2 - 2) 

for i = 1, . . . , k. 

Define k — 1 further formal series . . . , G^ifc-i by 

G k ^ 1+l = Gfc - m g ~fi-* +2 (2.4) 
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for i = 2, . . . , k. Then for these new series, by A2.2H we have 
Gk-l+i 

^ A>i (_l)A g (2fc+l)(^)+ l A^ 1 + g (2fc-2z+l)A) _ J2 x>1 (-l) X q (2k+1) ^ + {l - 1)X (l + g( 2fc - 2i +3) A ) 
^ A>i (_l)A g (2fc+l)(^)+(i-l)A^A( 1 + g (2fc-M+l)A) _ (j + g (2fc-2i+3)A)] 

^rwi-^) 

^ A>i (_l)A+l g (2fc+l)(^)+(i-l)A^ _ g A)( X _ g (2fc-2*+2)A) 
^ (_l)Ag(2A+l)(^)+(2fc+i)A( 1 _ gA+l^j _ ff (fe-i+l)(2A+2)j 



nn>i(i-^ n ) 

for i = 2, . . . , k. In particular, Gfc_i + ,; € C[[g]] (that is, Gk-i+i is a formal power series), and its 
constant term is 1. 

Moreover, (j2.5j) remains valid for i = 1 as well, since the right-hand side for i = 1 agrees with 
(|2.3p for i = k. Indeed, when i = 1, the right-hand side of (|2.5p is 

£ A>o (_l)A g (2fc+l)Q)+(2fc+l)A (1 _ g A+l )(1 _ ? 2fc(A+l)) 

J rin>i(i-? n ) ' 

Breaking up the last factor in the numerator, we obtain two terms, the first of which can be 
rewritten as 

EA> (-i) A g (2fc+1)Q)+2fcA (g A -g 2A+1 ) 

Iln>l(l-<Z n ) 

After the re- indexing A — > A — 1, the second term becomes 

^ A>i (_l)A g (2A;+l)( A - 1 ) + (2A ; +l)(A-l)+2feA^ 1 _ g \j 

~^ n„>i(i-9") ' 

Noting that ( A 2^) + — 1) = (2) an d that allowing A = in the numerator only results in 
adding zero, we can combine the two terms to obtain (j2.3j) for i = k. That is, (j2.5[) holds for all 
i = 1,2,...,*;. 

In general, for j > 1 and i = 2, . . . , k, define the formal series 

„ _ C(fc-i)(j-i)+fc-i+i ~ G(fc-i)(j-i)+fc-i+2 , _v 

- ^-zi)- • ^-°) 

Theorem 2.1. For j > and i = I, . . . , k, G( k _i^j +i € C[[gJ] and in /aci 



(2.5) 



C(fc-l)j+i 



^ A>o (_l)A g (2fc+l)(^) + [fe(j+l)+i]A( 1 _ ? A+1) ...(!_ _ ? (*-i+l)(2A+i+l)) 

J n„>i(i-« n ) ' 

(2.7) 



In particular, denoting the right-hand side of {2. 7| ) by H^ k _iyj +i , we have that for each j > 1, 
the two expressions for G^j-j+i given by {2. 7| ) are equal: 



H 



(k-i)j+i 



(fc-l)0'-l)+*- 



(2.8) 



Proof. By ([231), 4ZZD holds for j = 0. By ([221) and the above, §F7§ and ([23]) both hold for 
j = 1. Take j > 1. Suppose that (]2.7[) holds for G^_i-jj +q , I < q < k. We will show that it 
holds for 1 < i < A;. 

First let i = 2, . . . , k. By the recursion (|2.6p . we have 



G 



__ ^(fc-l^+fc-i+l ~~ G( fc _ 1 )j +fc _ i+2 
(fe-l)(j+l)+i _ (i-l)(j+l) 



^ A>0 (_l)Ag(2fc+l)(*)+[*(j+l)+*-*+l]A( 1 _ ? A+1) ...(!_ g A+i)(1 - g *(2A+i+l)) 

^ (_l)A ? (2*+l)(J)+[*(j+l)+fc-*+2]A^ 1 _ g A+l) ...(!_ gA+i)(l _ g (i-l)(2A+j+l)) 

9 (i - i)W+i) n„>i(i-9 n ) 

We use the last factor in each of the two numerators to split each of the two summations into 
two sums, Yli = £11 ~ £i2> £2 = £21 ~~ £22' an d we combine the terms in a different way: 

(En -£21) + (-£12 + £ 22 )- " 

Now J2n ~ £21 ec l ua l s 



^^ o (_l)A g (2fe+l)Q)+[feO-+l)+fc-i+l]A (1 _ g A )(1 _ ? A+1) ...(!_ g A+i) 



(2.9) 



Note that the A = term in (|2.9|) vanishes, so the summation is actually over A > 1. Making 
the index change A — > A + 1 we obtain 

^ A>o (_l)A+l g (2fc+l)(^) + [fc(j+2)+2fe-i+2]A+fe(j+l)+fc-i+l^ 1 _ q \+l^ ...(]__ q X+j+l^ 

^ A>o (_l)A+lg(2fc+l)(2) + [fe(i+2)+i]A+(fc-i+l)(j+2) ( ^ 1 _ q \+lj g \+j+lj q (2k-2i+2)\ 

rin>i(i-? n ) ' 

Similarly, — + YI22 ec l ua l s 

^ (_l)A ? (2fc+l)Q)+[fcO'+l)+fc-H]A+(i-l)(j+l)( 1 _ ^A+l^ 



(2.10) 



(1 



7 A+i)(_ g A+j+l + 



g (i-D(j+D n„>i(i-9-) 

£ A>0 (-l)\( 2fe+1 K2)+[^(i+2)+^ (1 _ g A+l) . . . (1 _ g A +i)(1 _ q 



U n >l^-Q n ) 



(2.11) 



(i 
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Combining (pTO]) and (f2TTTj) we get 

£ (_l)Ag( 2fc + 1 )(2)+[ fc O'+ 2 )+*1 A (l _ q \+l^ q ^+j+^l _ q (k-i+l)(2\+j+2)j 

G (k - m+ i) + i = — rwi-9") : 

proving (|2.7p for the case j + 1 and i = 2, . . . , k. 

For the case j + 1 and i = 1, we observe that (|2.7p follows from the induction hypothesis and 
(|2.8p for the case j + 1, and (|2.8p in turn follows (for any j) by virtually the same argument as 
the one above for j = 1. □ 

Theorem 12.11 implies that for j > 0, 

1 _ g (fc-i+i)(?+i) 
(l- 9 J+i)(l- g i+2)... 

^ (_l)A g (2fe+l)Q)+[fe(j+l)+i]A^ _ g A+l) ...(!_ g*+i)(l - g (*-i+l)(2A+i+l)) 

+ ~^ n„>i(i-9 n ) ' 

= 1 + ^' +1 7? +1) (?) if 1 < * < * - 1 
or i + g j+2 7l i+2) (g) ift = Jfe, 

where 

7?(?)eC[i. 

This is our "Empirical Hypothesis," in the sense explained in the Introduction. 

Using (|2,6p in the form (|2.17p below together with (I2.18P below, we write each Gj, 1 < i < k, 
in terms of G^_iy + i, . . . , G(fc_i)j + & for each j = 0, 1, 2, ... , giving a sequence of expressions 
(one for each j) for G±, . . . , of the form 

Gi = i/4 J ^(fc-i)j+i + • " + ih^ G(fc_i)j4-fc, (2.12) 

where for each j the coefficients ihj form akxk matrix of polynomials in q with nonnegative 
integral coefficients. More explicitly, define row vectors 

= ' , . . . , i^'^]- 

For j = we have 

ih(°> = [0,...,1,...,0], 

with 1 in the i-th position, so that b.(°) is the identity matrix. The jh^ satisfy the same set of 
recursions with respect to j, independently of i. Explicitly: 

Proposition 2.1. Let j > 1. With the left subscript i suppressed, we have 

h? = ^ 1) +---+Ci 1) +C 1) 

hf = (^'-D + .-. + ^-Dy 
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or in general, 



h 



U) 



O-i) 



1< I < Jfc. 



In matrix form, this is: 



h (j) = h C?'-i)A 



0')' 



(2.13) 
(2.14) 



TOi/i h our k x k matrix defined above and with 



L (i) 



q> 
q> 




g2j 






in particular, 

for all j > 0. 
Proo/. By {53 



(1)^(2) 







Hi) 



(2.15) 



<2(k-l)(j--i)+l 



G 



(fe-i)O'-i) 



(fc-l)j+fc-Z+l 



(2.16) 



(2.17) 



for _7>l,Z = l,... 3 fc — 1, and the lemma follows from the repeated application of this formula 
together with the tautological fact that 



G 



(fc-i)(j-i)+fc 



G(k-i)j+i- 



(2.18) 
□ 



In the course of the vertex-algebraic interpretation of the Rogers- Selb erg recursions in [CLM2J, 
it was implicitly noticed that matrices analogous to the matrices Ay) along with their inverses, 
involving the two variables in those recursions, could be used to reformulate those recursions. 
Such matrices indeed arise naturally from recursions of these types. 

Proposition 2.2. For each j > 1 and i, I = 1, . . . , k, the polynomial ih^ € C[q] is the generating 
function for partitions with difference at least 2 at distance k — 1 such that 1 appears at most 
k — i times, such that the largest part is at most j, and such that j appears exactly I — 1 times. 

Proof. It is sufficient to show that the combinatorial generating functions described here have 
the same initial values and recursions as the polynomials ih ) . We say that a partition is of 
type (k — 1, k — i) if it has difference at least 2 at distance k — 1 and 1 appears at most k — i 
times. Then (|2. 13[) corresponds to the following combinatorial fact: For j > 2, 

the number of partitions of m of type (k — 1, k — i) such that the largest part is at most 
j and such that j appears exactly / — 1 times 



k-l+l 

E 

P =i 

largest part is at most j 



the number of partitions of m — (I — l)j of type (k — 1, k — i) such that the 

1 and such that j — 1 appears exactly p — 1 times. 
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The initial values 



M 1] = [l,q,q 2 ,--- ,q k ~\0,--- , 0] 



also match those of the generating functions. 
Recall the products Gi in (|2.ip . 



□ 



Theorem 2.2. For 1 < i < k, Gi is the generating function for partitions with difference at 
least 2 at distance k — 1 such that 1 appears at most k — i times. 

Proof. This follows immediately from (|2,12p . Proposition 12.21 and the Empirical Hypothesis. □ 

This result constitutes Gordon's identities, as formulated in Theorem 7.5 of [A2J; the Rogers- 
Ramanujan identities form the special case k = 2. 

3. Matrix interpretation 

The right-hand side of (|2,12p suggests a product of matrices, and the recursions for the 
come from the recursions (|2.17p (or equivalently, ()2.6[) ) for the G s , s > 1, together with (|2,18p . 
We now express all of this in matrix form, and in the process we quickly rederive Proposition 
12.11 obtaining the matrices h™' and their properties from (|2,17p and (I2.18p . 

Set 

" G 1 



'(o) 



Gk 



and in general, 



'U) 



G 



G 



for j > 0. Also set 



B 



0') 













o 






(fc-i)i+i 
(k-i)j+k _ 

o 
o 



-2.; 



-2/ 








q v -> J -q 

for j >1. Then §2M)) (or equivalently, flgTTD ) and (pT8l) assert that 



'U) 



B (i) G (i-i) 



(3.1) 



for j > 1, so that 

for j>0. But 

(recall (|2.15p ). which gives 



r (i) 



B (i) B (i-i) 



B (1) G (Q) 



B (i) 



(AO-))" 1 



A 0') G (i) 



G (i-i) 
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for j > 1 and 

G( ) = A (1) A (2) ••• A (i) G (i) 
for j > 0. Defining h^' recursively by 

= identity matrix, 
h (i) = h&'-^Ay) 
for j > 1 (cf. (EH])), we have ([2TT6]) along with (f2TT2"]) . in the form 

G (0) = hWGy, 

for each j > 0. Thus from ()2. 17j) and (|2.18p expressed in matrix form, we have an "automatic" 
reformulation and proof of Proposition 12.11 along with (j2. 12|) . 

The combinatorial interpretation of the entries of h.C?) in Proposition 12.21 is a separate matter, 
as is the combinatorial interpretation of the G s , s > 1, given in Theorem 14.11 below. 

4. Interpretation of the sequence of expressions for Gi 

All of the formal power series G s for s > 1 can be interpreted as combinatorial generating 
functions, and this will allow us to "explain" the meaning of the sequence of equalities (I2.12p 
for d, ...,G k - 

We start with the expression of G\ , . . . , G k as the generating functions given by Theorem 12.21 
The recursions (|2.17|) , or equivalently, (I2.6p , determine all the G s for s > 1 from G\ , . . . , Gk , 
and we know from Theorem 12.11 that all the G s , s > 1, are formal power series. 

We have the following "complement" to Proposition 12.21 reflecting and illustrating the com- 
plementary nature of the recursions (12.130 and (12.170 (or equivalently, of (|2.14j) and (|3.1j) ): 

Theorem 4.1. For j > and I = 1, . . . , k, the formal power series G^~i)j+i is the generating 
function for partitions with difference at least 2 at distance k — 1 such that the smallest part is 
greater than j and such that j + 1 appears at most k — I times. 

Proof. It is sufficient to show that these combinatorial generating functions have the same initial 
values and recursions as the formal power series G^ k _^j + i. The recursion (|2.17p for j > 1 and 
I = 1, . . . , k — 1 corresponds to the following combinatorial fact: For j > 1 and I = 1, . . . , k — 1, 

the number of partitions of m with difference at least 2 at distance k — 1 such that 

the smallest part is at least j and such that j appears exactly k — I times 

= the number of partitions of m — (k — l)j with difference at least 2 at distance 

k — 1 such that the smallest part is greater than j and such that j + 1 appears 

at most I — 1 times. 

By Theorem [22] (the case j = 0), these assertions prove the result. (Note that the combinatorial 
interpretations of the two expressions equated in (|2.18p indeed agree.) □ 

For j = 0, (12.120 says simply that Gi = Gi, and for j > 1, combining Proposition 12.21 and 
Theorem 14. II we immediately have: 

Theorem 4.2. For I = 1, . . . , k and j > 1, the right-hand side of Ii2.12\) expresses the generat- 
ing function Gi as the sum of its contributions corresponding to the number of times, namely, 
0, 1, . . . , k — 1, that the part j appears in a partition. 
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Remark 4.1. With the benefit of the picture that has emerged, we can give an alternate, shorter 
proof of Theorem 12.21 (Gordon's identities), without needing Proposition 12.11 or 12.21 using the 
following uniqueness observation: Let Ji, J2, . . . be a sequence of formal power series in q with 
constant term 1 satisfying the recursions (I2.17P for j > 1 and I = 1, . . . , k — 1 (with J in place 
of G), and suppose that the Empirical Hypothesis holds for Ji, J2, . . . . The comments above 
in connection with (I2.12P give a sequence of expressions of the form fl2. 12f) (with J in place of 

G), with the coefficients the same polynomials as in (|2.12p (and now, we do not have to 
compute them). By the Empirical Hypothesis, the k formal power series J%, . . . , are uniquely 
determined, and thus so is the whole sequence Ji, J2, . . . . But by the proof of Theorem 14.11 the 
combinatorial generating functions defined in the statement of Theorem 14.11 form a sequence 
K±, K2, ... of formal power series with constant term 1 satisfying the recursions (12. 17h . and the 
Empirical Hypothesis trivially holds for K±,K2, ■ ■ ■ ■ Thus by the uniqueness, J s = K s for each 
s > 1. Then by Theorem 12.11 which gives the Empirical Hypothesis for G%, G2, ■ ■ ■ , we have 
(without using Theorem 12. 2D that G s = K s for each s > 1, and this statement for s = 1, . . . , k 
constitutes Gordon's identities. This remark generalizes the corresponding alternate proof of 
the Rogers- Ramanuj an identities discsussed in [ABj . [R] and [A3) . 
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